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by reference). Other examples of statistical analysis software available for principal-component- 
based methods include SPSS (SPSS Inc., Chicago, IL), JMP (SAS Inc., Cary NC), Stata (Stata 
Inc., College Station, TX) and Cluster. 



In the Claims 

Please delete claims 27-54 and 59-64 without prejudice. Applicants reserve the right to 
file divisional applications directed to these claims withdrawn from consideration in the present 
application. 

Pleas e amend the claims as follows. . 

1 . (Amended) A method of predicting a biological activity of a test compound, 
comprising: \ 

obtaining spectral data for the test compound and for a training set of compounds having 
known biological activities; 

scaling the spectral data of the training set of compounds prior to deriving a pattern of 
spectral data associates with the biological activity; 

: deriving the pattern of spectral data associated with the biological activity from the 
spectral data of the training set of compounds; and 

predicting the biological activity of the test compound by detecting, similarities between 
the pattern of spectral data\associated with the biological activity and a pattern of spectral data 
for the test compound. \ 

2. (Amended) The method of claim 1, wherein the spectral data are obtained without 
first correlating the spectral dam with corresponding structural features. 

3. (Amended) The method of claim 1, wherein the pattern of spectral data associated 
with a biological activity is derivedwithout first correlating the spectral data with corresponding 
structural features. \ 
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4. (Amended) The method of claim 1, wherein the pattern of spectral data of the 
training set is a pattern obtained by segmenting the spectral data of the training set of compounds 
into sub-spectral units. 

5. (Amended) TBe method of claim 4, wherein the pattern of spectral data of the test 
compound is obtained by segmenting the spectral data of the test compound into substantially the 
same sub-spectral units into which the spectral data of the training set is segmented. 



6. (Reiterated) The method of claim 1, wherein the spectral data is one type of 
spectral data. 

(Amended) Tlie method of claim 6, wherein the spectral data is one of nuclear 
acetic resonance, mass spectral, infrared, ultraviolet- visible, fluorescence, or 
phosphorescence data. 




8. (Reiterated) The method of claim 1, wherein the spectral data is a composite of 
different types of spectral data. 




9. (Amenafed) The method of claim 8, wherein the compositexomprises two or more 
of the group consisting of nuclear magnetic spectroscopy (NMR), mass spectroscopy (MS), 
infrared (IR) spectroscopy and ultraviolet- visible (UV-Vis) spectroscopy. 

10. (Amended) The method of claim 1 , wherein the spectral data of the test 
compound is segmented into substantially the same spectral sub-units as the spectral data of the 
training set of compounds to produce the spectral pattern for the test compound. 



1 1 . (Amended) The mdthod of claim 1 , wherein scaling comprises auto-scaling. 



12. (Amended) The method of claim 1, further comprising weighting the spectral data 
of the training set to emphasize signals that are important for determining an endpoint class of 
compounds in the training set before deriving the pattern associated with the biological activity. 
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13. 

weighting. 



(Amended) The method of claim 12, wherein weighting comprises Fisher- 



14. (Amended) The method of claim 1, wherein detecting similarities between the 
pattern of spectral data associated with a biological activity of the training set and the pattern of 
spectral data for the test compound comprises statistical pattern recognition. 



15. (Amended) Tlfe method of claim 10, wherein detecting similarities between the 
pattern of spectral data associated with a biological activity of the training set and the pattern of 
spectral data for the test compound comprises detecting relative intensities of one or more of the 
sub-units of the pattern of spectral data derived from the training set, and detecting relative 
intensities of signals associated wi^i the same one or more sub-units of a spectrum of the test 
compound. 



16. (Reiterated) The method of claim 15, wherein the relative intensities are canonical 
variate factors of the spectral data associated with a biological activity of the training set and the 
spectral signals of the test compound. 



17. (Aritended) The method of claim 1, wherein the method comprises artificial" 
intelligence pattern recognition. 




18. (Amended) A computer implemented method for predicting a biological activity 
of a test compound, comprising: 

receiving spectral oata for a test compound as input; 

receiving spectral data and endpoint data of a training set of compounds having known 
biological activities as input; 

segmenting the spectral \iata of the training set of compounds into sub-spectral units; 
scaling the segmented spectral data of the training set of compounds; 
detecting a pattern of spectral data associated with the biological activity; and 
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predicting^ the biological activity of the test compound by comparing the pattern of 
spectral data associated with the biological activity to the spectral data of the test compound to 
determine whethenthe spectral data of the test compound is similar to the spectral pattern 
associated with thelbiological activity of the training set and the test compound is predicted to 
share the biologicallactivity. 

19. (Amended) The computer implemented method of claim 18, wherein comparing 
comprises comparing with a statistical pattern recognition program. 

20. (Amended^ The computer implemented method of claim 19, wherein the spectral 
data for the test compoundVs segmented into substantially identical sub-spectral units as the 
training set spectral data, so\hat a signal within an individual sub-spectral unit is compared to 
the corresponding sub-spectral unit of the pattern. 

21. (Amended) The computer implemented method of claim 18, wherein the spectral 
data are selected from the group consisting of nuclear magnetic resonance data, mass spectral 
data, infrared data, ultraviolet-visible data, fluorescence data, phosphorescence data, and 
composites of two or more such spectral data. 

22. (Amended) The comput&r implemented method of claim 21 , wherein the pattern 
associated with the biological activity oMie training set is a set of canonical variate factors, and 
the spectral data for the test compound are compared to the canonical variate factors of the 
training set spectral data. \ 

23. (Amended) The computer implemented method of claim 22, wherein the 
biological activity is binding affinity to a hormtme receptor, and the canonical variate factors 
include peaks in sub-spectral units that are associated with hormone receptor binding of a pre- 
selected affinity. \ 



24. (Amended) The computer implemented method of claim 23, wherein the spectral 
data comprise nuclear magnetic resonance data and mass spectral data. 
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25. (Reiterated) A computer readable medium having stored thereon instructions for 
performing the actions of claim 1. 

26. (Reiterated) A computer readable medium having stored thereon instructions for 
performing the actions of claim 18. __ 



o 1 - 



55. (Amended) The method of claim 1, wherein the spectral data comprises calculated 
spectral data. 




56. (Amended) The computer implemented method of claim 18, wherein the spectral 
data comprises calculated spectral data. 

57. (Amended) The computer implemented method of claim 56, wherein the 
calculated spectral data comprises calculated nuclear magnetic resonance data. 

58. (Amended) The computer implemented method of claim 57, wherein the 
calculated nuclear magnetic resonance data comprises calculated 13 C NMR data. 

65. (Amended) The method of claim 1, further comprising: 
predicting a second biological activity of the test compound by comparing the spectral 
data of the test compound\p a second pattern of spectral data associated with the second 
biological activity to determine if the test compound shares the second biological activity, the 
second pattern derived using scaled spectral data and known endpoints of a training set of 
compounds for the second biological activity. 
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Please add the following new claims 66-90. 



—66. (Neyv) The method of claim 1, wherein the spectral data comprises l H, 13 C, 15 N, 
17 0, 19 F, 31 Por 35 ST^MRdata. 



eft 




67. (New) T^ie method of claim 66, wherein the spectral data comprises calculated 
spectral data. 

68. (New) The method of claim 57, wherein the calculated nuclear magnetic 
resonance data comprises 1 A 13 C, l5 N, 17 0, 19 F, 31 P or 35 S NMR data. 

69. (New) A methoc^for predicting a biological property of a test compound, 
comprising: 

providing biological activity data for a plurality of compounds; 
providing spectral data for the plurality of compounds; 
providing spectral data for theVest compound; 
segmenting the spectral data forahe test compound and the plurality of compounds into 

bins; 

scaling the spectral data of the plurality of compounds in the bins; 

weighting the spectral data of the plurality of compounds in the bins; 

detecting a pattern of spectral data of qje plurality of compounds in the bins that is 1 
correlated with the biological activity; and 

detecting similarities between the patternVorrelated with the biological activity and the 
spectral data of the test compound to determine if uie molecule shares the biological activity. 

70. (New) The method of claim 69, wherein scaling comprises autoscaling. 

71 . (New) The method of claim 70, wherein weighting comprises Fisher-weighting. 

72. (New) The method of claim 69, wherein weWhting comprises Fisher-weighting. 



73. (New) The method of claim 69, wherein scaling comprises variance scaling. 
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74. (New) The method of claim 69, wherein the pattern detected in the bins comprises 
a set of canonical vallate factors. 

75. (New) The method of claim 69, wherein the spectral data comprises nuclear 
magnetic resonance, maW spectral, infrared, ultraviolet- visible, fluorescence, or 
phosphorescence data. \ 

76. (New) The mWiod of claim 75, wherein the spectral data comprises I3 C NMR 
data. \ 

77. (New) The method of claim 76, wherein the 13 C NMR data comprises calculated 
13 C NMR data. \ 

78. (New) The method of cjaim 75, wherein the spectral data comprises a composite 
of two or more types of spectral data. \ 

79. (New) The method of claim a8, wherein the composite comprises 13 C NMR data 
and EI-MS data. \ 

80. (New) The method of claim 69, wherein detecting a pattern comprises statistical 
pattern recognition. \ 

Lenting into bins comprises 
0.5 ppm to 5.0 ppm. 



8 1 . (New) The method of claim 76, wherein se 
segmenting the 13 C NMR data into bins having a width fror 
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82. \(New) A computer implemented method for predicting the biological activity of a 
test compoundAcomprising: 

receivings as input spectral data for a test compound; 

receiving as input spectral data and biological activities of a training set of compounds; 
segmentinAthe spectral data of the training set into bins; 
autoscaling the spectral data of the training set; 
Fisher- weighting the spectral data of the training set; 

detecting a pattern of spectral data characteristic of an endpoint class of compounds in the 
training set of compounos; 

segmenting the spectral data for the test compound into bins; and 
predicting the biological activity of the test compound by detecting similarities between 
the segmented spectral data olf the test compound and the pattern of spectral data characteristic of 
the endpoint class to predict whether the test compound is also in the endpoint class. 

1 83. (New) The computer implemented method of claim 82, wherein detecting a 
pattern of spectral data associated with an endpoint class of compounds in the training set of 
compounds comprises calculating canonical variate factors for the bins. 

84. (New) The computer implemented method of claim 82, wherein the spectral data 
of the test compound and the spectral data ordhe training set of compounds comprise spectral 
data selected from the group consisting of nuclear magnetic resonance data, mass spectral data, 
infrared data, ultraviolet-visible data, fluorescencte data, phosphorescence data, and composites 
thereof. \ 

85. (New) The computer implemented method of claim 84, wherein the spectral data 
of the test compound and the spectral data of the training set of compounds comprises a 
composite of two or more types of spectral data and the spectral data is normalized to yield 
structure descriptors of similar magnitude from each type of spectral data. 
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